CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion

نویسندگان

چکیده

Camera-LiDAR 3D object detection has been extensively investigated due to its significance for many real-world applications. However, there are still of great challenges address the intrinsic data difference and perform accurate feature fusion among two modalities. To these ends, we propose a two-stream architecture termed as CL3D, that integrates with point enhancement module, point-guided module IoU-aware head cross-modal detection. Specifically, pseudo LiDAR is firstly generated from RGB image, (PEM) then designed enhance raw point. Moreover, (PFM) developed find image-point correspondence at different resolutions, incorporate semantic geometric features in point-wise manner. We also investigate inconsistency between localization confidence classification score detection, introduce prediction (IoU Head) box regression. Comprehensive experiments conducted on publicly available KITTI dataset, CL3D reports outstanding performance compared both single- multi-modal detectors, demonstrating effectiveness competitiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LiDAR-Camera Calibration using 3D-3D Point correspondences

With the advent of autonomous vehicles, LiDAR and cameras have become an indispensable combination of sensors. They both provide rich and complementary data which can be used by various algorithms and machine learning to sense and make vital inferences about the surroundings. We propose a novel pipeline and experimental setup to find accurate rigid-body transformation for extrinsically calibrat...

متن کامل

Fusing Bird View LIDAR Point Cloud and Front View Camera Image for Deep Object Detection

We propose a new method for fusing a LIDAR point cloud and camera-captured images in the deep convolutional neural network (CNN). The proposed method constructs a new layer called non-homogeneous pooling layer to transform features between bird view map and front view map. The sparse LIDAR point cloud is used to construct the mapping between the two maps. The pooling layer allows efficient fusi...

متن کامل

Stereo Image Point Cloud and Lidar Point Cloud Fusion for the 3d Street Mapping

Combining active and passive imaging sensors enables creating a more detailed 3D model of the real world. Then, these 3D data can be used for various applications, such as city mapping, indoor navigation, autonomous vehicles, etc. Typically, LiDAR and camera as imaging sensors are installed on these systems. Both of these sensors have advantages and drawbacks. Thus, LiDAR sensor directly provid...

متن کامل

Feature enhancing aerial lidar point cloud refinement

Raw aerial LiDAR point clouds often suffer from noise and under-sampling, which can be alleviated by feature preserving refinement. However, existing approaches are limited to only preserving normal discontinuous features (ridges, ravines and crest lines) while position discontinuous features (boundaries) are also universal in urban scenes. We present a new refinement approach to accommodate un...

متن کامل

Robust Curb Detection with Fusion of 3D-Lidar and Camera Data

Curb detection is an essential component of Autonomous Land Vehicles (ALV), especially important for safe driving in urban environments. In this paper, we propose a fusion-based curb detection method through exploiting 3D-Lidar and camera data. More specifically, we first fuse the sparse 3D-Lidar points and high-resolution camera images together to recover a dense depth image of the captured sc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems

سال: 2022

ISSN: ['1558-0016', '1524-9050']

DOI: https://doi.org/10.1109/tits.2022.3154537